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Abstract 

^^ We consider the following online allocation problem: Given a unit square S', 

— i^ and a sequence of numbers Ui S {0,1} with X)j=o"'J — ^' ^^ each step i, 

pvi select a region d of previously unassigned area Ui in S. The objective is to 

, make these regions compact in a distance-aware sense: minimize the maximum 

^ I (normalized) average Manhattan distance between points from the same set Ci. 

■^\ Related location problems have received a considerable amount of attention; 

in particular, the problem of determining the "optimal shape of a city", i.e., 

^^ allocating a single rii has been studied. We present an online strategy, based 

on an analysis of space-filling curves; for continuous shapes, we prove a factor 

I""! of 1.8092, and 1.7848 for discrete point sets. 

^\ Keywords: Clustering, average distance, online problems, optimal shape of a 

city, space-filling curves, competitive analysis. 

,o, 

,__! 1. Introduction 

> 

,_^ Many optimization problems deal with allocating point sets to a given envi- 

f~^ ronment. Frequently, the problem is to find compact allocations, placing points 

^N from the same set closely together. One well-established measure is the average 

^1 Li distance between points. A practical example occurs in the context of grid 

^^ computing, where one needs to assign a sequence of jobs i, each requiring an 

^^ (appropriately normalized) number ni of processors, to a subset Ci of nodes of 

^ , a large square grid, such that the average communication delay between nodes 

. . of the same job is minimized; this delay corresponds to the number of grid 

*^ hops [TD], so the task amounts to finding subsets with a small average Li, i.e., 

^^ Manhattan distance. Karp et al. [7] studied the same problem in the context of 

\^ memory allocation. 

^ Even in an offline setting without occupied nodes, finding an optimal allo- 

cation for one set of size n.i is not an easy task; as shown in Fig. [l] the results 
are typically "round" shapes. If a whole sequence of sets have to be allocated, 
packing such shapes onto the grid will produce gaps, causing later sets to be- 
come disconnected, and thus leads to extremely bad average distances. Even 
restricting the shapes to be rectangular is not a remedy, as the resulting prob- 
lem of deciding whether a set of squares (which are minimal with respect to 
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Figure 1: Finding optimal individual shapes. (Left) An optimal shape composed of n=72 grid 
cells, according to 5 . (Right) The optimal limit curve w{x), according to [S]- 



Li average distance among all rectangles) can be packed into a given square 
container is NP-hard 9 ; moreover, disconnected allocations may still occur. 

In this paper, we give a first algorithmic analysis for the online problem. Us- 
ing an allocation scheme based on a space-filling curve, we establish competitive 
factors of 1.8092 and 1.7848 for minimizing the maximum average Manhattan 
distance within an allocated set. 



Related Work 

Compact location problems have received a considerable amount of atten- 
tion. Krumke et al. [S] have considered the offline problem of choosing a set of n 
vertices in a weighted graph, such that the average distance is minimized. They 
showed that the problem is NP-hard (even to approximate); for the scenario in 
which distances satisfy the triangle inequality, they gave algorithms that achieve 
asymptotic approximation factors of 2. For points in two-dimensional space and 
Manhattan distances, Bender et al. [2] gave a simple 1.75-approximation algo- 
rithm, and a polynomial-time approximation scheme for any fixed dimension. 

The problem of finding the "optimal shape of a city", i.e., a shape of given 
area that minimizes the average Manhattan distance, was first considered by 
Karp, McKellar, and Wong [7]; independently. Bender, Bender, Demaine, and 
Fekete [1^ showed that this shape can be characterized by a differential equation 
for which no closed form is known. For the case of a finite set of n points that 
needs to be allocated to a grid, Demaine et al. |5] showed that there is an 0(n^-^) 
dynamic-programming algorithm, which allowed them to compute all optimal 
shapes up to n = 80. Note that all these results are strictly offline, even though 
the original motivation (register or processor allocation) is online. 

Space-filling curves for processor allocation with our objective function have 
been used before, see Leung et al. [10]; however, no algorithmic results and no 
competitive factor was proven. Wattenberg [15] proposed an allocation scheme 
similar to ours, for purposes of minimizing the maximum Euclidean diameter 
of an allocated shape. Like other authors before (in particular, Niedermeier 
et al. lllj and Gotsman and Lindenbaum [S]), he considered a measure called 
c-locality: for a sequence 1, . . . , i, . . . , j, . . . of points on a line, a space-filling 
mapping h{.) will guarantee L2{h{i),h{j)) < cy/\j — i\, for a constant c that 
is y/G sa 2.449 for the Hilbert curve, and 2 for the so-called H-curve. One can 
use c-locality for establishing a constant competitive factor for our problems; 



however, given that the focus is on bounding the worst-case distance ratio for 
an embedding instead of the average distance, it should come as no surprise 
that the resulting values are significantly worse than the ones we achieve. On a 
different note, de Berg, Speckmann, and van der Weele [4] consider treemaps 
with bounded aspect ratio. Other related work includes Dai and Su [5]. 

Our Results 

We give a first competitive analysis for the online shape allocation problem 
within a given bounding box, with the objective of minimizing the maximum 
average Manhattan distance. In particular, we give the following results. 



• 



• 



We show that for the case of continuous shapes (in which numbers Ui cor- 
respond to area) , a strategy based on a space-filling Hilbert curve achieves 
a competitive ratio of 1.8092. 

For the case of discrete point sets (in which numbers indicate the number 
of points that have to be chosen from an appropriate N x N orthogonal 
grid), we prove a competitive factor of 1.7848. 

• We sketch how these factors may be further improved, but point out that 
a Hilbert-based strategy is no better than a competitive factor of 1.3504, 
even with an improved analysis. 

• We establish a lower bound of 1.144866 for any online strategy in the case 
of discrete point sets, and argue the existence of a lower bound for the 
continuous case. 

The rest of this paper is organized as follows. In Section 2, we give some basic 
definitions and fundamental facts. In Section 3, we provide a brief description of 
an allocation scheme based on a space-filling curve. Section 4 gives a mathemat- 
ical study for the case of continuous allocations, proving that the analysis can be 
reduced to a limited number of shapes, and establishes a competitive factor of 
1.8092. Section 5 sketches a similar analysis for the case of discrete allocations; 
as a result, we prove a competitive factor of 1.7848. Section 6 discusses lower 
bounds for online strategies. Final conclusions are presented in Section 7. 

2. Preliminaries 

We examine the problem of selecting shapes from a square, such that the 
maximum average Li-distance of the shapes is minimized. We first formulate 
the problem more precisely. This covers both the continuous and the discrete 
case; the former arises as the limiting case of the latter, while the latter needs 
to be considered for allocations within a grid of limited size. 

Definition 1. A city is a (continuous) shape in the plane with fixed area. For 
a city C of area n, we call 

ciC) = lffff {\x-u\ + \y-v\)dvdudydx (1) 

^ J J J J{x,y),{u,v)eC 



the total Manhattan distance between all pairs of points in C and 

the 0-value or average distance of C . An n-town T is a subset of n points in 
the integer grid. Its normalized average Manhattan distance is 

The normalization with n^'^ yields a dimensionless measure that remains 
unchanged under scaling, and makes the continuous and the discrete case com- 
parable; see [T]. 

Problem 2. In the continuous setting, we are given a sequence rii, 712, ..., rife S 
K+ with X]i=i "■' — 1- Cities Ci, C2, . . . , C^ of size ni,n2, . . . ,nk are to be 
chosen from the unit square, such that maxi<j<fe <j){Ci) is minimized. 

In the discrete setting, we are given a sequence ni,n2, . . ■ ,nk G N+ with 
X]i=i '^i — -^^- Towns Ci,C2,...,Cfe of size ni,n2, . . . ^n^. are to be chosen 
from the N x N grid, such that maxi<j</c (t>{Ci) is minimized. 

Although it has not been formally proven, the offline problem is conjectured 
to be NP-hard, see [TJ]; if we restrict city shapes to be rectangles, there is an 
immediate reduction from deciding whether a set of squares can be packed into 
a larger square 9!. (A special case arises from considering integers, which cor- 
responds to choosing grid locations.) Our approximation works online, i.e., we 
choose the cities in a specified order, and no changes can be made to previously 
allocated cities; clearly, this implies approximation factors for the corresponding 
offline problems. 

There are lower bounds for maxi<i<fc 4>{Ci) that generally cannot be achieved 
by any algorithm. One important result is the following theorem. 

Theorem 3. Let C be any city. Then (f){C) > 0.650245. 

A proof can be found in |lj. For ni — 1 any algorithm must select the 
whole unit square, thus 2/3, the (/)-value of a square, is a lower bound for the 
achievable (h-value. We will discuss better lower bounds in the conclusions. 



3. An Allocation Strategy 

While long and narrow shapes tend to have large 0-values, shapes that fill 
large parts of an enclosing rectangle with similar width and height usually have 
better average distances; however, one has to make sure that early choices with 
small average distance do not leave narrow pieces with high average distance, 
or even disconnected pieces, making the normalized (jj-vahies potentially un- 
bounded. 

Our approach uses the recursive Hilbert family of curves in order to yield a 
provably constant competitive factor. That family is based on a recursive con- 
struction scheme and becomes space-filling for infinite repetition of said scheme 
[12) . For a finite number r of repetitions, the curve traverses all points of the 
used grid. For 1 < r < 3, the curve is shown in Fig. [2] Thus, the Hilbert curve 



provides an order for the cells of the grid, which is then used for allocation, 
as illustrated in Fig. [3J More formal details of the recursive definition of the 
Hilbcrt family (e.g. with text-rewriting rules, such as the ones in [14]) are cited 
and sketched in the following Section |4] 




Figure 2: Hilbert curve with 1 < r < 3. 

More technically, the unit square is recursively subdivided into a grid con- 
sisting of 2'' X 2'' grid cells, for an appropriate refinement level r > 0, as shown 
in Fig. [2] For the sake of concise presentation, we assume that every input 
Ui is an integral multiple of c = 4~^, for an appropriately large R > Q. (We 
will mention in the Conclusions how this assumption can be removed, based on 



Lemma 10 ) Similar to the recursive structure of quad-trees, the actual subdi- 
vision can be performed in a self-refining manner, whenever a grid cell is not 
completely filled. This means that during the course of the online allocation, 
we may use different refinement levels in different parts of the layout; however, 
this will not affect the overall analysis, as further refinement of the grid does 
not change the quality of existing shapes. 

Definition 4. For a give refinement level r, an r-pixel P is a grid square 0/ 
size 2^'' X 2^''. For a given allocated shape Ci, a pixel is full if P Q C^; it is 
fractional, ifPnC.,^9 and P (/L d. 

Now the description of the algorithm is simple: for every input n^, choose 
the next set of ni/2^ i?-pixels traversed by the Hilbert curve as the city Ci, 
starting in the upper left corner of the grid. For an illustration, see Fig. [3] 

The following lemma is a consequence of the recursive structure of the Hilbert 
family; see the following Section|4]for a formal argument. We use it in Section[5] 
for deriving upper bounds. 

Lemma 5. Let C be a city generated by our strategy with area at most n < 
1 4p 4^-R JQj. j' G {0, 1, . . . , i?}, I S N. Then at any refinement level r, C contains 
at most two fractional r-pixels. 
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Figure 3: A sample allocation according to our strategy. 




4. Technical Details of Shape Allocation 

A technical different description of the Hilbert family can be based on the 
string representation given in [T4j . There, the authors use the following recur- 
sion, based on the letters u, r, d, 1 for denoting "up", "right", "down", and 
"left". 



2/1 = urd 

Vn = /i4(y„-i)u2;„-ir2/„_id/i5(2;„_i) ff n > 1 

where /14 and ft.5 are defined as 
if a: = u, 

h4{x) = •!. ' and /i5(x) = 

u if a; = r, 

if a; = 1. 

and hi{xoXiX2 ■ ■ ■) = hi{xo)h,{xi)hi{x2) ...,«€ {4, 5}. 

We combine those characters to four sequences of length three and build 
an L-system, a special kind of formal grammar with parallel rewriting. Let 
A = urd, B = Idr, C — rul, and D = dlu, as shown in Figure |4] Using the 
production rules 

A -^ CAAB, 

B -^ DBBA, 

C ^ ACCD and 

D -^ BDDC 

and starting with A, you get the same string from [TJ if you replace A0A1A2A3 
by XouXirX2dX3, Aq, Ai, A2, A3 e {A,B,C,D}, after each use of a produc- 
tion rule. 

A sequence of symbols produced by the L-system can be interpreted graph- 
ically as a sequence of sub-squares K^-iblifc], j, fc G {1, . . . , S"""^}, see Figure 51 
The figure also shows that ^ is I? rotated by 180°. The same is true for B and 
C. We denote by A the symbol that is A rotated by 180°. As we are only 
interested in the shape of cities, we identify symmetric cities. Consequently, we 
make no distinction between A and A when we look at a single symbol. We 
write A = A. Similarly, we get equivalences for longer sequences of symbols: 
XY = YX, because the order of the successive sub-squares i?r_i[j][fc] does not 
change the shape. Furthermore, we have A A = YY for each A and F, as it is 
always the simple shape followed by the rotated shape, and AA = AA for two 
occurrences of the same shape in succession. 

Lemma 6. After r > 1 uses of the production rules (where r = 1 means that we 
are still at the start symbol A), the resulting sequence contains all sub-sequences 
of length 2^~^ that can be created with the L-system, except for symmetry. 

Proof: We prove the claim via induction over r: 
r = 2 : CAAB contains C, B, and A = A — D, i.e., all sequences of length 1. 



r = 3 : ACCDCAABCAABDBBA contains 

• AA = DD 

• AB = BA = DC=CD 

• AC = CA = DB = BD 

• BC=CB = AD = DA 

• BB = CC\ 

i.e, all possible sequences of length 2. 

r —>■ r + l,r > 3 : A sequence of length 2'""^ has been created from a sequence 
of length at most 2''~^/4+ 1 = 2^~^ + 1, because every symbol is replaced 
by exactly 4 symbols. If the claim holds for r > 3, then all sequences 
of length 2*""^ have been created, i.e., all sequences of length 2'^~^ + 1 
have been created as well. Thus, in the next step the production rules are 
applied to all possible sequences of that length, yielding all sequences of 
length 2''^^ that can be created with the L-system. D 



Lemma 7. 

j,ke {1,.. 



n 

A city Wn contains at most parts of In/A] +1 sub-squares E^_i[j] [k] , 
,2''"^}, i.e., sub-squares that consist of exactly 4 cells. 



Proof: A city Wn consists of exactly n cells. Assume that these cells belong 
to at least [n/4] + 2 sub-squares i?^_]^[j][A:]. Then at least three of those sub- 
squares cannot belong to Wn as a whole but only in part, because n cells cannot 
completely fill more than [r7,/4] of the sub-squares, and if two more are partially 
filled not even all those can be filled completely. Consider the sequence of sub- 
squares of Wn in the order given by the Hilbert curve. One of the sub-squares 
that is not the first or the last in the sequence cannot completely belong to Wn- 
This is a contradiction to the definition of the Hilbert curve, which recursively 
uses the same construction scheme for sub-squares on every level of refinement. 

D 

Lemma 8. When the L-system has generated all sequences of length [n/4] +1, 
the resulting Hilbert curve traverses a city Wn. 

Proof: Each symbol of the L-system corresponds to a sub-square E^__^ [j] [k] . 
Once all sequences of length [n/4] -(- 1 symbols have been generated, all possi- 
ble cities consisting of [n/4] -f 1 sub-squares have been generated, too. With 
Lemma [7] the claim follows. D 

Lemma 9. For r — [log2([n/4] +1)] +2 the Hilbert curve traverses a city Wn- 



Figure 4: A, B, C, and D 



Proof: Using Lemmas [6] and [8] we know that the Hilbert curve traverses a 
city Wn, if the fohowing holds: 

2'-2 > [n/4] + 1 
This is true for 

r=riog2(rn/4l+l)l+2. 

n 



5. Analysis 

For the analysis of our allocation scheme we will first make use of Lemma [5] 
As noted in Lemma [TOJ filling in the two fractional pixels of an allocated shape 
yields an estimate for the total distance at a coarser refinement level. In a sec- 
ond step, this will be used to derive global bounds by computing the worst-case 
bounds for shapes of at most refinement level 3; thus the presented computa- 
tional results for shapes of limited size are not merely experiments, but yield a 
general upper bound on the competitive factor. (As discussed in the Conclu- 
sions, carrying out the computations on a lower or higher refinement level gives 
looser or tighter results.) 

In the following, we denote by Wn the worst city consisting of n pixels that 
can be produced by our Hilbert strategy; because of the normalized nature of 
0, this is independent on the size of the pixels, and only the shape matters. 

Lemma 10. Let C be a city generated by our strategy with area at most n < 
WA-^ forr (^ {0, 1, . . . , i?}, / G N. Then we have c(C) < c{Wi+i), where Wi+i 
is a worst case among all cities produced by our allocation scheme that consists 
of (l + 1) r-pixels. 

Proof: By Lemma [5j we know that only the first and the last pixel of C 
may be fractional. Therefore C cannot intersect more than / + 1 r-pixels. By 
replacing the two fractional pixels by full pixels, we get a city W that consists 
of / + 1 fuh r-pixels, and c[C) < c{W). By definition, c{W) < c{Wi+i), and the 
claim holds. 

Therefore, we can give upper bounds for the worst case by considering the 
values of Wn at some moderate refinement level. The Wn can be found by 
enumeration; as described by the technical lemmas in preceding section, a speed- 
up can be achieved by making use of the recursive construction of the Wn- We 
determined the shapes and 0- values of the Wn for n < 65; by Lemma [TOJ this 
suffices to provide upper bounds for all cities with area up to 64*2"'', i.e., these 
computational results give an estimate for the round-up error using refinement 
level 3. The full table of average distances for this refinement level can be seen 
in Table [T] the worst cases among the examined ones are W56 and 14^14, which 
have the same shape, shown in Fig. [5] 

Theorem 11. Our strategy guarantees maxi<„<fe (j){Cn) < 1.1764. 



Figure 5: Worst cases Wn for 12 < n < 17. 



Table 1: Total and average distances for cities Wn allocated according to our strategy, as well 
as the optimal values Cdtyin) according to [5]- 
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0.8395 
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0.8376 
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0.8517 
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0.8074 
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0.8202 


0.9393 
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0.8446 
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13 
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0.8621 
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0.8781 
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0.8425 


0.9439 


15 


371^3 


290 V3 


0.8530 




48 
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5234 


0.8446 


0.9505 


16 


434^3 


338^3 


0.8490 


1.1764 


49 


7147 73 


550773 


0.8505 


0.9512 


17 


512^3 


396 V3 


0.8605 


1.1497 


50 


7586 V3 


5788 


0.8583 


0.9516 


18 


602 V3 


457 V3 


0.8764 


1.1174 


51 


7993 


6076 73 


0.8606 


0.9559 


19 


685 


522 V3 


0.8706 


1.1058 


52 


8411 V3 


6368 


0.8628 


0.9620 


20 


768 


591 V3 


0.8587 


1.1098 


53 


8878 


669173 


0.8683 


0.9619 


21 


870 


663 


0.8610 


1.0903 


54 


9379 V3 


701773 
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0.9617 


22 
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749 V3 


0.8745 
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55 
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0.9225 
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0.8061 
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0.8527 
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0.9710 


63 


13415 


10305 73 


0.8517 
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31 


2213 


1759^3 


0.8272 


0.9726 


64 


1392473 


1071973 


0.8499 




32 


2393 V3 


1898^3 


0.8263 


0.9791 


65 


14452^/3 


11139 ''/3 


0.8486 




33 


2602 


2057 


0.8319 


0.9735 













Proof: Consider a city C of size n generated by our strategy. If n is 
sufficiently small, i.e., smaller than an i?— r-pixel, r >0, C consists of at most 
4'' cells and its average distance can be bounded by the worst case for that 
particular number of cells. In the case that C has a larger, more complicated 
shape, an analysis of a finite number of shapes is still sufficient: 

We know that n > A^c and can assume that n < 4:^~^^c (or else we use the 
analysis on the less refined E'^_,-_^j^.[p][q]). Thus, there must be an I such that 

Wc < n < {I + l)4-'c with I ~ 1, . . . , 3. Yet, we can get closer to n, as we 
know that i?^_ [p][q] consists of 4-' cells. We get the inequality W"'' < n < 
{I + l)4J-'=c, fc < j, / = 4^=, . . . , 4*^+1 - I. 

Hence, a city of arbitrary size n corresponds to at most {I + I) sub-squares 



of a certain size (depending on the precision of the analysis), i.e., a city of size 
at most (Z + l)4-'~'^c. Now we can use Lemma ^ to bound the average distance 
of the city, yielding 



2c{W) ^ ^(W^i+2)(a + 2)4^-fec)5/2 



= <l^{Wi+2)[l + -A =:HWi). (5) 

The resuhing bound is max({0(VFi) : 1 < i < ^^}\J{<^{Wi) : A'' < I < 4'=+i-l}). 
Note that the number of shapes considered is at most 4'^+^. 

We conducted the calculations for fc = 2 and list the results in Table [TJ As 
it turns out, the maximum is attained for ^(Wie) = 1.1764. D 

Corollary 12. Our strategy achieves a competitive factor of 1.8092. 

Proof: According to Theorem [3] no algorithm can guarantee a better (f>- 
value than 0.650245. Our strategy yields an upper bound of 1.1764. This results 
in a factor of 1.1764/0.650245 « 1.8092. D 

6. Discrete Point Sets 

Our above analysis relies on continuous weight distributions, which imply 
the lower bound on </>- values stated in Theorem 1. This does not include the 
discrete scenario, in which the values Ui indicate a number of integer grid points 
that have to be chosen from an appropriate TV x A^-grid. As discussed in the 
paper [S], considering discrete weight distributions may allow lower average 
distances; e.g., a single point yields a 0- value of 0. As a consequence, towns 
(subsets of the integer grid) have lower average distances than cities of the same 
total weight. However, we still get a competitive ratio for the case of online 
towns. 

Theorem 13. For n-towns, a Hilbert-curve strategy guarantees a competitive 
factor of at most 1.7848 for the (j}-value. 



Proof: Lemma [5] still holds, so analogously to Theorem 11_ we consider 



the values up to n = 64, and show that the worst case is attained for n — 16, 
which yields an upper bound of 1.123. See Table [2] for detailed numbers. 

For a lower bound, the general value of 0.650245 for ^-values cannot be 
applied, as discrete point sets may have lower average distance. Instead, we 
verify that the ratio p{n) of achieved to optimal cj), is less than 1.7848; this 
is the same as ciTn) / Ctown{n) for n < 64, see Table [2J For 65 < n < 80, the 
optimal values in [5] allow us to verify that > 0.6292; see our Table Isj 

Thus, we have to establish a lower bound for for n > 81. We make use 
of equation (5), p. 89 of |S]; see Fig. |6] for a given number n of grid points, 
the difference between the optimal total Manhattan distance Cdtyin) for a city 
consisting of n unit squares and the optimal total distance Ctown ("•) for a town 

consisting of n grid points is equal to A(n) :— | I ^ • cf + ^ ■ r^ ) , where Ci is 
the number of grid points in column i, and rj is the number of grid points in 
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cj = 5 


C2 = 3 


C3 =2 


C4 = 1 


C5 =0 


5 = 1 






4 = 1 






3 = 2 








2 = 3 










1 = '1 













Figure 6: Establishing a lower bound for ip- Defining A(n); an arrangement that maximizes 

A(n). 



row j. Because ""'a^i" is bounded from below by V' = 0.650245, we get a lower 



bound of ip 



2A(«) ^ 2ct„„„(n) 



for the d-valne of an n-town. 



This leaves the task of providing an upper bound for 2A(n)/n^'^. According 
to Lemma 5 of [5|, the bounding box of an optimal n-town does not exceed 
2y/n + 5. Therefore, we have q < 2y^ + 5; as ^^ Ci = n and the function ^^ cf 
is superlinear in the Ci, we conclude that ^^ cf is maximized by subdividing n 
into 



columns of 2y^+5 points each, so J2i S < n(2Y^+5). Analogously, 
we have X),^? < n{2^ + 5), so 2A{n)/n'^-^ < |(^ + ^). For n > 81, this 



2yri+5 



3 ^ n 



implies 2A{n)/n^-^ < 2i3 + Sll? = 0.0210333... or 0(n) > 0.6292. This yields 

an overall competitive ratio of not more than 1.123/0.6292, i.e., 1.7848. D 

A more refined analysis of A(n) considers maximizing ^^ cf + X^i ^? ^^1 ^^ 

once, instead of ^- cf and ^ r^ separately, for a maximum value of n(2y^ + 



5) + 5^7=^. For n > 81, this yields 2A(n)/; 



,2.5 



< 



243 ' 2187 



+ g|j- =0.0137373. 



As the resulting competitive ratio of 1.7643 is only very slightly better, we 
omit further details. If instead we rely on the unproven conjecture in [S] that 
^'^'a.'s" « "0 — 2;^^, we get (f> > 0.6451, which corresponds to experimental 
evidence; the resulting competitive factor is 1.7406. 



7. Lower Bounds 

We demonstrate that there are non-trivial lower bounds for a competitive 
factor. We start by considering the discrete online scenario for towns. 

Theorem 14. No online strategy can guarantee a competitive factor below -^ = 
1.144866.... 

Proof: Consider a 3 x 3 square, and let ni — 4; see Fig. [tI If (a) the 
strategy allocates a 2 x 2 square (for a total distance of 8), then n2 = 5, and 
the resulting L-shape has a total distance of 20 and a (/)-value of 40/5^'^ = 
0.715541... Allocating (b) the first town with an L-shape of total distance 10 
results m (f) ~ 20/32 = 0.625, and the second with a total distance of 16, or 
(j> = 32/525 = 0.572433... 
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Table 2: Total distances 
ing optimal values ctown 



for towns T„ allocated according to our strategy, and the correspond- 
for n-towns. 



n 


c(T„) 


cta^^in) 


P(") 


0(T„) 


*(T„) 


n 


c(T„) 


cto»„(n) 


P(") 


'P(T,^) 


*(T„) 


1 








- 


0.0000 




34 


2763 


2153 


1.2833 


0.8198 


0.9453 


2 


1 


1 


1.0000 


0.3536 




35 


2968 


2318 


1.2804 


0.8191 


0.9482 


3 


4 


4 


1.0000 


0.5132 




36 


3186 


2486 


1.2816 


0.8194 


0.9550 


4 


10 


8 


1.2500 


0.6250 




37 


3436 


2656 


1.2937 


0.8252 


0.9511 


5 


20 


16 


1.2500 


0.7155 




38 


3713 


2847 


1.3042 


0.8342 


0.9473 


6 


33 


25 


1.3200 


0.7485 




39 


3960 


3040 


1.3026 


0.8338 


0.9366 


7 


44 


38 


1.1579 


0.6788 




40 


4216 


3241 


1.3008 


0.8333 


0.9263 


8 


66 


54 


1.2222 


0.7292 




41 


4448 


3446 


1.2908 


0.8265 


0.9112 


9 


92 


72 


1.2778 


0.7572 




42 


4687 


3662 


1.2799 


0.8200 


0.9017 


10 


123 


96 


1.2812 


0.7779 




43 


4904 


3886 


1.2620 


0.8089 


0.9112 


11 


148 


124 


1.1935 


0.7376 




44 


5154 


4112 


1.2534 


0.8027 


0.9203 


12 


194 


152 


1.2763 


0.7778 




45 


5524 


4360 


1.2670 


0.8133 


0.9205 


13 


244 


188 


1.2979 


0.8009 




46 


5909 


4612 


1.2812 


0.8235 


0.9212 


14 


301 


227 


1.3260 


0.8209 




47 


6252 


4868 


1.2843 


0.8257 


0.9260 


15 


348 


272 


1.2794 


0.7987 




48 


6610 


5128 


1.2890 


0.8282 


0.9331 


16 


410 


318 


1.2893 


0.8008 


1.1230 


49 


7012 


5398 


1.2990 


0.8344 


0.9339 


17 


488 


374 


1.3048 


0.8191 


1.1001 


50 


7447 


5675 


1.3122 


0.8425 


0.9347 


18 


575 


433 


1.3279 


0.8366 


1.0708 


51 


7848 


5960 


1.3168 


0.8450 


0.9393 


19 


656 


496 


1.3226 


0.8338 


1.0626 


52 


8262 


6248 


1.3223 


0.8474 


0.9458 


20 


736 


563 


1.3073 


0.8229 


1.0700 


53 


8724 


6568 


1.3283 


0.8532 


0.9459 


21 


836 


632 


1.3228 


0.8273 


1.0530 


54 


9221 


6890 


1.3383 


0.8606 


0.9460 


22 


957 


716 


1.3366 


0.8431 


1.0360 


55 


9672 


7222 


1.3392 


0.8623 


0.9414 


23 


1064 


804 


1.3234 


0.8388 


1.0091 


56 


10136 


7556 


1.3415 


0.8638 


0.9370 


24 


1176 


895 


1.3140 


0.8335 


0.9831 


57 


10560 


7896 


1.3374 


0.8610 


0.9295 


25 


1280 


992 


1.2903 


0.8192 


0.9472 


58 


10995 


8243 


1.3339 


0.8583 


0.9224 


26 


1387 


1091 


1.2713 


0.8048 


0.9388 


59 


11400 


8604 


1.3250 


0.8527 


0.9123 


27 


1480 


1204 


1.2292 


0.7814 


0.9483 


60 


11816 


8968 


1.3176 


0.8475 


0.9086 


28 


1618 


1318 


1.2276 


0.7800 


0.9570 


61 


12196 


9354 


1.3038 


0.8393 


0.9098 


29 


1796 


1442 


1.2455 


0.7931 


0.9486 


62 


12669 


9749 


1.2995 


0.8371 


0.9068 


30 


1985 


1570 


1.2643 


0.8054 


0.9437 


63 


13220 


10146 


1.3030 


0.8393 


0.9051 


31 


2148 


1704 


1.2606 


0.8029 


0.9464 


64 


13724 


10556 


1.3001 


0.8376 




32 


2326 


1840 


1.2641 


0.8031 


0.9540 


65 


14256 


10972 


1.2993 


0.8370 




33 


2532 


1996 


1.2685 


0.8095 


0.9489 















Table 3: </>- values of optimal n-towns, calculated using Table 1 from \5\. 



n 


<t>opt{n) 


n 


4>opt{n) 


n 


(f)opt{n) 


n 


4>opt{n) 


65 


0.644217 


69 


0.643676 


73 


0.645275 


77 


0.645053 


66 


0.644281 


70 


0.644399 


74 


0.645136 


78 


0.645234 


67 


0.644240 


71 


0.645067 


75 


0.645072 


79 


0.645524 


68 


0.644104 


72 


0.645317 


76 


0.644715 


80 


0.645595 



If instead, (c) the first town is allocated different from a square, the total 
distance is at least 10, and (j) > 20/32; then (d) n2 ~ nj, ~ n4 = n^ ^ n^ ~ 1, 
and an optimal strategy can allocate the first town as a 2x2 square, with (j) — 0.5. 
This bounds the competitive ratio, as claimed. D 

For the case of continuous allocations, we claim the following. 

Theorem 15. There is 6 > 0, such that no online strategy can guarantee a 
competitive factor 1 + S. 

Proof: Consider rii = 1/2, in combination with the two possible scenarios 

(a) n2 - 1/2; 

(b) n2 = 713 = . . . = e. 

In scenario (a), an adversary can assign two (1 x l/2)-rectangles, for a 0- value 
of 0.707...; in scenario (b), an adversary can assign all shapes as squares, for a 
0-value of 0.666... If the player chooses a square size a/2/2 first, the adversary 
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(a) 








(b) 

































(c) 



(d) 



Figure 7: The four cases considered in Theorem |14| the left column shows the choices by an 
algorithm, the right the corresponding optimal choices for the ensung sequence. 



(a) (b) (c) 

Figure 8: The scenarios considered in Theorem |15[ and a possible choice for the player. 

can choose scenario (a), causing the second aUocation to be in L-shape with 
(/)-value |(7 - 4^2) = 0.895431..., as opposed to the optimal value of 0.707... If 
the player chooses a (1 x l/2)-rectangle first, the adversary chooses scenario (b), 
for a ratio of 1.06066... The existence of the claimed lower bound follows from 
continuity, as the cjj-vahie changes continuously with continuous deformation of 
the involved shapes. D 

The precise value arising from the scenarios in Theorem[T5]is complicated. It 
can be obtained by computing the optimal intermediate value for the player that 
allows him to protect against both scenarios at once. For example, optimizing 
over the family of allocations shown in Figure ^ (c) yields a competitive ratio 
that is better than 1.06; however, the player may do even better by using curved 
boundaries. The involved computational effort for the resulting optimization 
problem promises to be at least as complicated as computing the "optimal 
shapes of a city" , for which no closed-form solution is known, see [71 [T] . 

8. Conclusions 

We have established a number of results for the online shape allocation 
problem. In principle, further improvement could be achieved by replacing the 
computational results for level 3 (i.e., n = 16,..., 64) by level 4 (i.e., n — 
65, . . . , 256). (Conversely, a simplified analysis with level 2, i.e., n = 4, . . . , 16; 
yields a worse factor of 3.6525.) However, the highest known optimal ^-values 
are for n = 80, obtained by using the 0(n^'^) algorithm of [5]. In any case, 
there is a threshold of 1.3504 for Hilbert-based strategies, which we believe 
to be tight: this is the ratio between the upper bound of 0.8768 for n = 14 
(and for n = 56, 224, . . .) and the asymptotic lower bound of 0.650245; because 
asymptotically, continuous and discrete case converge, this also applies to the 
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discrete case. Other open probleras are to raise the lower bound of 1.144866 for 
the discrete case, and establish definitive values for the continuous case. 

As noted in Section 3, we can eliminate the assumption of all Ui being 



multiples of some 2^^, by making use of Lemma 10, and allocating a small 
round-off fraction to a fractional pixel maintains the same bounds. However, the 
formal aspects of describing the resulting allocation scheme become somewhat 
tedious and would require more space than seems appropriate. 

The offline problem is interesting in itself: for given n^, allocate disjoint re- 
gions of area rii in a square, such that the maximum average Manhattan distance 
for each shape is minimized. As mentioned, there is some indication that this is 
an NP-hard problem; however, even relatively simple instances are prohibitively 
tricky to solve to optimality, making it hard to give a formal proof. Clearly, 
our online strategy provides a simple approximation algorithm; however, better 
factors should be possible by exploiting the a-priori information of knowing all 
rii, e.g., by sorting them appropriately. 




Figure 9: A possible worst-case scenario for the offline problem. 

Another interesting open question for the offline scenario is the maximum 
optimal (/>-value for any set ni, . . . , n^. A simple lower bound is 2/3 = 0.666..., 
as that is the average distance of the whole square. A better lower bound is is 
provided by dividing the square into two or three equal-sized parts. For the case 
ni = n2 ~ 1/2, we can use symmetry and convexity to argue that an optimum 
can be obtained by a vertical split, yielding (j) = •\/2/2 = 0.707. We believe the 
global worst case is attained for fii = n2 = n^ = 1/3. Unfortunately, it is no 
longer possible to exploit only symmetry for arguing global optimality. Figure [9] 
shows an allocation with cj) = 0.718736... for all three region^ We conjecture 
that this is the best solution for m — n2 — n^ — 1/3, as well as the worst case 
for any optimal partition of the unit square. 
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^More precisely, the involved values can be expressed as a = j^ (55 — ^^ 

,77- 

1/3 



_ (9602477-13416v'585705)e+(202679+204V585705)i^2+82133e^ 

^" 77760\/3e ™' 
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